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METHODS FOR ANALYZING LTC4 SYNTHASE POLYMORPHISMS AND DIAGNOSTIC USE 

This invention relates to polymorphisms in the LTC 4 synthase gene. The invention 
also relates to methods and materials for analysing allelic variation in the LTC4 synthase gene, 
5 and to the use of LTC4 synthase polymorphism in the diagnosis and treatment of leukotriene 
mediated diseases such as asthma and allergic rhinitis. 

The cysteinyl-leukotrienes, LTC 4 , LTD 4 and LTE 4 , are potent bronchoconstrictors, 
increase vascular permeability and increase mucus production in airways. They are implicated 
in the pathophysiology of asthma and allergic rhinitis and are found at elevated levels in 
10 bronchoalveolar lavage from asthma patients, particularly after allergen challenge. LTD 4 and 
LTE 4 may also enhance the neurogenic inflammatory response in airways. Compounds which 
inhibit leukotriene synthesis e.g. the 5-lipoxygenase inhibitor, zileuton, or the leukotriene 
~ceptor antagonist, zafirlukast, have been shown to be effective against asthma and rhinitis 
f B Jsse W.W, Clin. Exp. Allergy, 26, 868-879, 1996; see particularly Figure 1 therein which 
15 shows the arachidonic acid cascade, indicating the role of LTC 4 synthase in catalysing the 
formation of LTC 4 ). 

Leukotrienes are derived from membrane phospholipids. Arachidonic acid is released 
from the phospholipid by cytosolic phospholipase A2 and converted to LTA4 by 5- 
lipoxygenase in the presence of 5-lipoxygenase activating protein, FLAP. Polymorphisms in 
20 5-LO have been reported in international patent application WO 97/42347, Brigham & 

Women's Hospital. LTA 4 is conjugated with reduced glutathione by LTC 4 synthase to form 
LTC 4 . The biologically active metabolites, LTD 4 and LTE 4 are formed, following carrier 
mediated export of LTC 4 , by the sequential action of gamma-glutamyl transpeptidase and 
dipeptidases. 

25 The LTC 4 synthase gene has been cloned and published as a 4,465 nucleotide genomic 

sequence comprising 1,446 nucleotides of sequence 5' to the coding sequence, the 5 exons 
and intervening introns and 3' sequence extending 398 nucleotide beyond the polyA signal 
(Penrose et al, J. Biol. Chem., 271, 11356-11361, 1996; EMBL accession no. U50136). 

One approach is. to use knowledge of polymorphisms to help identify patients most 

30 suited to therapy with particular pharmaceutical agents (this is often termed 

"pharmacogenetics") . Pharmacogenetics can also be used in pharmaceutical research to assist 
the drug selection process. Polymorphisms are used in mapping the human genome and to 
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elucidate the genetic component of diseases. The reader is directed to the following 
references for background details on pharmacogenetics and other uses of polymorphism 
detection: Linder etal (1997), Clinical Chemistry, 43, 254; Marshall (1997), Nature 
Biotechnology, 15, 1249; International Patent Application WO 97/40462, Spectra Biomedical; 
5 and Schafer et al (1998), Nature Biotechnology, 16, 33. 

Clinical trials have shown that patient response to treatment with leukotriene 
antagonists is heterogeneous. Thus there is a need for improved approaches to pharmaceutical 
agent design and therapy with leukotriene antagonists. 

The present invention is based on the discovery of five single nucleotide 
10 polymorphisms (SNPs) in the LTC 4 synthase gene. Three SNPs have been found in the 5* 
untranslated region of the gene and two in the first intron, located at positions 375, 815, 1003, 
2169 and 2742 respectively, based on the numbering of U50136. Before our first filing date, 
we believe there has been no disclosure of polymorphism/allelic variation in the LTC 4 
synthase gene. 

15 According to one aspect of the present invention there is provided a method for the 

diagnosis of a single nucleotide polymorphism in LTC 4 synthase in a human, which method 
comprises determining the sequence of the nucleic acid of the human at one or more of 
positions 375, 815, 1003, 2169 and 2742 in the LTC 4 synthase gene as defined by the 
positions in SEQ ID NO: 1, and determining the status of the human by reference to 

20 polymorphism in the LTC4 synthase gene. 

The term human includes both a human having or suspected of having a leukotriene 
mediated disease and an asymptomatic human who may be tested for predisposition or 
susceptibility to leukotriene mediated disease. At each position the human may be 
homozygous for an allele or the human may be a heterozygote. 

25 In one embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism at position 375 is presence of G 
and/or A. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 815 is presence of C 
30 and/or A. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 1003 is presence of A 
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and/or C Testing for the presence of the C allele at this position is especially preferred 
because, without wishing to be bound by theoretical considerations, of its association with 
increased levels of LTC 4 synthase (as explained herein). 

In another embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism at position 2169 is presence of C 
and/or T. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 2742 is presence of C 
and/or T. 

10 The method for diagnosis is preferably one in which the sequence is determined by a 

method selected from amplification refractory mutation system and restriction fragment length 
polymorphism. 

In another aspect of the invention we provide a method for the diagnosis of leukotriene 
mediated disease, which method comprises: 
15 i) obtaining sample nucleic acid from an individual, 

ii) detecting the presence or absence of a variant nucleotide at one or more of positions 375, 
8 15, 1003 and 2169 in the LTC 4 synthase gene and 

iii) determining the status of the individual by reference to polymorphism in the LTC 4 
synthase gene. 

20 The published sequence of the LTC 4 synthase gene, EMBL accession number U50136, 

is shown in SEQ ID NO: 1 in which the variant sites discovered in the present invention are at 
positions 375, 815, 1003, 2169 and 2742. 

Allelic variation at position 375 consists of a single base substitution from G (the 
published base), for example to A. Allelic variation at position 815 consists of a single base 

25 substitution from C (the published base), for example to A. Allelic variation at position 1003 
consists of a single base substitution from A (the published base), for example to C. Allelic 
variation at position 2169 consists of a single base substitution from C (the published base), 
for example to T. Allelic variation at position 2742 consists of a single base substitution from 
C (the published base), for example to T. The status of the individual may be determined by 

30 reference to allelic variation at one, two, three, four or all five of the above loci. 

Sanak et al (1998), Lancet, 350, 1599, have reported an increased risk of aspirin 
induced asthma (AIA) being associated with the polymorphism at position 1003. This work 
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suggests that the presence of the C allele at position 1003 leads to increased levels of LTC 4 
synthase (see also Cowburn et al (1998), J. Clin. Invest., 101, 834). AIA affects about 10 % 
of adult asthmatics. Aspirin and other cyclo-oxygenase inhibitors cause release of LTs into 
airways, leading to an asthma attack in susceptible individuals. Clinical approaches to deal 
5 with AIA include pretreatment with anti-leukotriene drugs (Szczeklik (1997), Allergy, 52, 
613-9). Commentators have written approvingly of the clinical utility of detection of LTC4 
polymorphisms (Holgate (1998), Lancet, 351, 1300-1301, see last paragraph in particular). 
Anti-leukotriene drugs have been reviewed in the following publications: Horwitz et al. 
(1998), Am J Respir Crit Care Med, 157, 1363 (see particularly Table 1 for a list of drugs); 

10 and Tan (1998), Current Opinion in Pulmonary Medicine, 4, 25. 

The test sample of nucleic acid is conveniently a sample of blood, bronchoalveolar 
lavage fluid, sputum, or other body fluid or tissue obtained from an individual. It will be 
appreciated that the test sample may equally be a nucleic acid sequence corresponding to the 
sequence in the test sample, that is to say that all or a part of the region in the sample nucleic 

15 acid may firstly be amplified using any convenient technique e.g. PCR, before use in the 
analysis of LTC 4 synthase variation. 

It will be apparent to the person skilled in the art that there are a large number of 
analytical procedures which may be used to detect the presence or absence of variant 
nucleotides at one or more of positions 375, 815, 1003, 2169 and 2742 in the LTC 4 synthase 

20 gene. In general, the detection of allelic variation requires a mutation discrimination 

technique, optionally an amplification reaction and a signal generation system. Table 1 lists a 
number of mutation detection techniques, some based on the PCR. These may be used in 
combination with a number of signal generation systems, a selection of which is listed in 
Table 2. Further amplification techniques are listed in Table 3. Many current methods for the 

25 detection of allelic variation are reviewed by Nollau et al., Clin. Chem. 43, 1 1 14-1 120, 1997; 
and in standard textbooks, for example "Laboratory Protocols for Mutation Detection", Ed. by 
U. Landegren, Oxford University Press, 1996 and "PCR", 2 nd Edition by Newton & Graham, 
BIOS Scientific Publishers Limited, 1997. 



30 
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Abbreviations: 



AIA 


Aspirin induced asthma 


ALEX™ 


Amplification refractory mutation system linear extension 


APEX 


Arrayed primer extension 


ARMS™ 


Amplification refractory mutation system 


b-DNA 


Branched DNA j 


CMC 


Chemical mismatch cleavage 


bp 


base pair 


COPS 


Competitive oligonucleotide priming system 


DGGE 


Denaturing gradient gel electrophoresis 


FLAP 


5 -lipoxygenase activating protein 


FRET 


Fluorescence resonance energy transfer 


LCR 


Ligase chain reaction 


5-LO 


5-Lipoxygenase 


LT 


Leukotriene 


MASDA 


Multiple allele specific diagnostic assay 


NASBA 


Nucleic acid sequence based amplification 


OLA 


Oligonucleotide ligation assay 


PCR 


Polymerase chain reaction 


PTT 


Protein truncation test 


RFLP 


Restriction fragment length polymorphism 


SDA 


Strand displacement amplification 


SNP 


Single nucleotide polymorphism 


SSCP 


Single-strand conformation polymorphism analysis 


SSR 


Self sus:ai nea replication 


TGGE 


Temperature gradient gel electrophoresis 



Table 1 - Mutation Detection Techniques 
General: DNA sequencing, Sequencing by hybridisation 
5 Scanning: PTT*, SSCP, DGGE, TGGE, Cleavase, Heteroduplex analysis, CMC, Enzymatic 
mismatch cleavage 
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* Note: not useful for detection of promoter polymorphisms. 
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Hybridisation Based 

Solid phase hyhridisation: Dot blots, MASDA, Reverse dot blots, Oligonucleotide 
5 arrays (DNA Chips) 

Solution phase hybridisation: Taqman™ - US-5210015 & US-5487972 (Hoffmann-La 
Roche), Molecular Beacons - Tyagi et al ( 1996), Nature Biotechnology, 14, 303; WO 
95/13399 (Public Health Inst., New York) 

Extension Based: ARMS™, ALEX™ - European Patent No. EP 332435 B 1 (Zeneca 
10 Limited), COPS - Gibbs et al (1989), Nucleic Acids Research, 17, 2347. 
Incorporation Based: Mini-sequencing, APEX 
Restriction Enzyme Based: RFLP, Restriction site generating PCR 
Ligation Based: OLA 
Other: Invader assay 

15 

Table 2 - Signal Generation or Detection Systems 

Fluorescence: FRET, Fluorescence quenching, Fluorescence polarisation - United Kingdom 
Patent No. 2228998 (Zeneca Limited) 

Other: Chemiluminescence, Electrochemiluminescence, Raman, Radioactivity, Coiorimetric, 
20 Hybridisation protection assay, Mass spectrometry 

Table 3 - Further Amplification Methods 
SSR, NASBA, LCR, SDA, b-DNA 

25 Preferred mutation detection techniques include ARMS™, ALEX™, COPS, Taqman, 

Molecular Beacons, RFLP, and restriction site based PCR and FRET techniques. 

Particularly preferred methods include ARMS™ and RFLP based methods. ARMS™ 
is an especially preferred method. 

In a further aspect, the diagnostic methods of the invention are used to assess the 
30 efficacy of therapeutic compounds in the treatment of asthma, rhinitis and other leukotriene 
mediated diseases. The polymorphisms identified in the present invention occur in the 5' 
untranslated region and the first intron of the LTC 4 synthase gene, regions which are of 
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importance in the control of gene transcription and gene translation. Furthermore, each of the 
variant positions is located within a known transcription factor binding site; it is believed that 
substitution of A at variant position 375 modifies an AP-2 CS4 transcription factor binding 
site, substitution of A at variant position 815 modifies an AP-2 CS5 transcription factor 
5 binding site, substitution of C at variant position 1003 modifies the glucocorticoid receptor 
binding site GGGACA and substitution of T at variant position 2169 disrupts an MREc-(3) 
transcription factor binding site. 

Example 3 below describes another polymorphism which is substitution of T for C at 
position 2742. This variant disrupts a RIPE3b site (Shieh and Tsai, J. Biol. Chem. 266, 

10 16708-16714, 1991). 

Assays, for example reporter-based assays, may be devised to detect whether one or 
more of the above polymorphisms affect transcription levels and/or message stability. 

Individuals who carry particular allelic variants of the LTC 4 synthase gene may 
therefore exhibit differences in their ability to regulate enzyme biosynthesis under different 

15 physiological conditions and will display altered abilities to react to different diseases. In 
addition, differences in enzyme regulation arising as a result of allelic variation may have a 
direct effect on the response of an individual to drug therapy. LTC4 synthase polymorphism 
may therefore have the greatest effect on the efficacy of drugs designed to modulate the 
activity of LTC 4 synthase or other components of the leukotriene pathway. However, the 

20 polymorphisms may also affect the response to agents acting on other biochemical pathways 
regulated by leukotrienes. The diagnostic methods of the invention may therefore be useful 
both to predict the clinical response to such agents and to determine therapeutic dose. 

In a further aspect, the diagnostic methods of the invention, are used to assess the 
predisposition and/or susceptibility of an individual to diseases mediated by leukotrienes. 

25 LTC 4 synthase polymorphism may be particularly relevant in the development of asthma and 
other inflammatory diseases such as allergic rhinitis and the present invention may be used to 
recognise individuals who are particularly at risk from developing these conditions. 

In a further aspect, the diagnostic methods of the invention are used in the 
development of new drug therapies which selectively target one or more allelic variants of the 

30 LTC 4 synthase gene. Identification of a link between a particular allelic variant and 
predisposition to disease development or response to drug therapy may have a significant 
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impact on the design of new drugs. Drugs may be designed to regulate the biological activity 
of variants implicated in the disease process whilst minimising effects on other variants. 

In a further diagnostic aspect of the invention the presence or absence of variant 
nucleotides is detected by reference to the loss or gain of sites recognised by restriction 
5 enzymes. In the accompanying Example 1 we provide details of convenient sites that are lost 
or gained as a result of LTC 4 synthase gene polymorphisms. The person of ordinary skill will 
be able to design and implement diagnostic procedures based on the detection of restriction 
fragment length polymorphism due to the loss or gain of one or more of the sites listed in 
Examples 1 or 3. 

10 In yet a further aspect the invention provides a variant of the LTC4 synthase gene 

comprising one or more of the specific polymorphisms at positions 375, 815, 1003 and 2169. 

Further aspects of this invention comprise the 5* untranslated region of the LTC 4 
synthase gene comprising a polymorphism at one or more of positions 375, 815 and 1003. In 
particular the polymorphism at position 375 is G to A. In particular the polymorphism at 

15 position 815 is C to A. In particular the polymorphism at position 1003 is A to C Another 
aspect of this invention comprises the first intron of the LTC4 synthase gene comprising a 
polymorphism at position 2169; in particular this polymorphism is C to T. 

According to another aspect of the present invention there is provided a nucleic acid 
comprising the 5' untranslated region of LTC 4 synthase comprising a polymorphism 

20 corresponding with one or more of positions 375, 815 and 1003 as defined by the positions in 
SEQ ID NO: 1 and in which there is an A at position 375, an A at position 815 and a C at 
position 1003. The 5' untranslated region of LTC 4 synthase is defined as positions 1-1446 of 
SEQ ID NO: 1. Fragments of the 5' untranslated region comprising at least one of these 
allelic variants are also within the scope of the invention. 

25 Fragments are at least 17 bases, more preferably at least 20 bases, more preferably at 

least 30 bases. Complementary strands are also within the scope of the invention. 

According to another aspect of the present invention there is provided a nucleic acid 
comprising the first intron of the LTC4 synthase gene comprising a polymorphism at one or 
more of positions 2169 and 2742 as defined by the position in SEQ ID NO: 1 and in which 

30 there is a T at position 2169 and there is a T at position 2742. The first intron of the LTC 4 
synthase gene is defined as positions 1505-2949 of SEQ ID NO: 1. Fragments of the first 
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intron comprising at least one of these allelic variants are also within the scope of the 
invention. 

The invention further provides nucleotide primers which detect the LTC 4 synthase 
gene polymorphisms of the invention. 
5 According to another aspect of the present invention there is provided a diagnostic 

nucleic acid primer capable of detecting a LTC 4 synthase gene polymorphism at one or more 
of positions 375, 815, 1003, 2169 and 2742 in the LTC 4 synthase gene as defined by the 
positions inSEQIDNO: 1. 

A diagnostic nucleic acid primer is defined as an allele specific primer, used, generally 

10 together with a constant primer, in an amplification reaction such as a PCR reaction, which 
provides the discrimination between alleles through selective amplification of one allele at a 
particular sequence position e.g. as used for ARMS™ assays, see Example 2 herein. The 
diagnostic primer is preferable 17- 50 nucleotides, more preferably about 17-35 nucleotides, 
more preferably about 17-30 n \c\ cotides. 

1 5 We provide diagnostic primers comprising the sequences set out below as well as 

derivatives thereof wherein about 6-8 of the nucleotides at the 3' terminus are identical to the 
sequences given below and wherein up to 10, such as up to 8, 6, 4, 2, or 1 of the remaining 
nucleotides may be varied without significantly affecting the properties of the diagnostic 
primer. Conveniently, the sequence of the diagnostic primer is as written below, or more 

20 preferably as described in Example 2 below. The diagnostic primer is preferably 17- 50 
nucleotides, more preferably about 17-35 nucleotides, more preferably about 17-30 
nucleotides. 



Primer 


Allelic variant 


Diagnostic (Allele Specific) Primer sequence 


number* 


detected 




i 


375 A 


GGGGCGGCCGGGGGCGCTCCAGGCGGGGCA 


2 


815 A 


CTTGGACAGGTTTCCTCCTGGCAGGGTGGA 


3 


1003 C 


GGGTTGCCAGGAACAGCCTGGATGGGGACC 


4 


2169 T 


ATGGTCCGACGGGAGGTCTGGGGAGGGAGT 


5 


375 A 


CTCCTGCCTGGAGTTCTGGGTGTCTCCCTT 


6 


815 A 


TAGTCGTTGTAGGGGTTCCATGCACAAGGT 


7 


1003 C 


TAACTCCTCCACCCACCTTATCTGTTCCCG 


8 


2169 T 


GACCACACACAGACCAGTGCTGGCTGTGCA 



* Primers 1-8 are represented as SEQ ID NO: 2-9 respectively. 
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The primers may be manufactured using any convenient method of synthesis. 
Examples of such methods may be found in standard textbooks, for example "Protocols for 
Oligonucleotides and Analogues; Synthesis and Properties," Methods in Molecular Biology 
Series; Volume 20; Ed. Sudhir Agrawal, Humana ISBN: 0-89603-247-7; 1993; 1 st Edition. If 
5 required the primer(s) may be labelled to facilitate detection. 

According to another aspect of the present invention there is provided an allele- 
specific oligonucleotide probe capable of detecting a LTC4 synthase gene polymorphism at 
one or more of positions 375, 815, 1003, 2169 and 2742 in the LTC 4 synthase gene as defined 
by the positions in SEQ ID NO: 1. 
10 The allele-specific oligonucleotide probe is preferably 17- 50 nucleotides, more 

preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

The design of such probes will be apparent to the molecular biologist of ordinary skill. 
Such probes are of any convenient length such as up to 50 bases, up to 40 bases, more 
conveniently up to 30 bases in length, such as for example 8-25 or 8-15 bases in length. In 
15 general such probes will comprise base sequences entirely complementary to the 

corresponding wild type or variant locus in the LTC4 gene. However, if required one or more 
mismatches may be introduced, provided that the discriminatory power of the oligonucleotide 
probe is not unduly affected. The probes of the invention may carry one or more labels to 
facilitate detection. 

20 According to another aspect of the present invention there is provided a diagnostic kit 

comprising a diagnostic primer of the invention and/or an allele-specific oligonucloetide 
primer of the invention. 

The diagnostic kits may comprise appropriate packaging and instructions for use in the 
methods of the invention. Such kits may further comprise appropriate buffer(s) and 

25 polymerase(s) such as thermostable polymerases, for example taq polymerase. 

The LTC 4 synthase gene has been mapped to chromosome 5q35 (Penrose et aU J. Bio!. 
Chem. 271, 11356-1 1361, 1996). In another aspect of the invention, the single nucleotide 
polymorphisms of this invention may be used as genetic markers for this region in linkage 
studies. This particularly applies to the polymorphism at 1003 because of its relatively high 

30 frequency, (Krugylak, Nature Genetics, 17, 21-24, 1997). 
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According to another aspect of the present invention there is provided a method of 
treating a human in need of treatment with an antileukotriene drug in which the method 
comprises: 

i) diagnosis of a single nucleotide polymorphism in LTC 4 synthase in the human, which 

5 diagnosis comprises determining the sequence of the nucleic acid at one or more of positions 
375, 815, 1003, 2169 and 2742 in the LTC 4 synthase gene as defined by the positions in SEQ 
ID NO: 1 , and determining the status of the human by reference to polymorphism in the LTC 4 
synthase gene; and 

ii) administering an effective amount of an antileukotriene drug. 

10 Preferably determination of the status of the human is clinically useful. Examples of 

clinical usefulness include deciding which antileukotriene drug or drugs to administer and/or 

in deciding on the effective amount of the drug or drugs. 

According to another aspect of the present invention there is provided use of an 

antileukotriene drug in preparation of a medicament for treating a leukotriene mediated 
15 disease in a human diagnosed as having a single nucleotide polymorphism at one or more of 

positions 375, 815, 1003, 2169 and 2742 in LTC 4 synthase gene as defined by the positions in 

SEQ ID NO: 1. 

According to another aspect of the present invention there is provided a 
pharmaceutical pack comprising an antileukotriene drug and instructions for administration of 
20 the drug to humans diagnostically tested for a single nucleotide polymorphism at one or more 
of positions 375, 815, 1003, 2169 and 2742 in LTC 4 synthase gene as defined by the positions 
in SEQ ID NO: 1. 

Suitable antileukotriene drugs include leukotriene D 4 receptor antagonists, FLAP 
antagonists and 5-lipoxygenase inhibitors (see particularly Table 1 in the following 
25 publication for a list of drugs, Horwitz et al. (1998), Am J Respir Crit Care Med, 157, 1363), 
preferably leukotriene D 4 receptor antagonists, more preferably montelukast and zafirlukast, 
and of these zafirlukast is most preferred. 

Testing for the presence of the C allele at position 1003 is especially preferred 
because, without wishing to be bound by theoretical considerations, of its association with 
30 increased levels of LTC 4 synthase (as explained herein). 

The invention will now be illustrated but not limited by reference to the following 
Examples. All temperatures are in degrees Celsius. 
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In the Examples below, unless otherwise stated, the following methodology and 
materials have been applied. 

AMPLITAQ™, available from Perkin-Elmer Cetus, is used as the source of 
thermostable DNA polymerase. 
5 General molecular biology procedures can be followed from any of the methods 

described in "Molecular Cloning - A Laboratory Manual" Second Edition, Sambrook, Fritsch 
and Maniatis (Cold Spring Harbor Laboratory, 1989). 

Electropherograms were obtained in a standard manner: data was collected by ABI377 
data collection software and the wave form generated by ABI Prism sequencing analysis 
10 (2.1.2). 

Example 1 

Identification of Polymorphisms 
1. Methods 

15 DNA Preparation 

DNA was prepared from frozen blood samples collected in EDTA following protocol I 
(Molecular Cloning: A Laboratory Manual, p392, Sambrook, Fritsch and Maniatis, 2 nd 
Edition, Cold Spring Harbor Press, 1989) with the following modifications. The thawed 
blood was diluted in an equal volume of standard saline citrate instead of phosphate buffered 

20 saline to remove lysed red blood cells. Samples were extracted with phenol, then 

phenol/chloroform and then chloroform rather than with three phenol extractions. The DNA 
was dissolved in deionised water. 
Template Preparation 

Templates were prepared by PCR using the oligonucleotide primers and annealing 

25 temperatures set out below. The extension temperature was 72° and denaturation temperature 
94°. Generally 50 ng of genomic DNA was used in each reaction and subjected to 40 cycles 
of PCR. 



Fragment 


Forward 
Oligonucleotide 


Reverse 
Oligonucleotide 


Annealing 
Temperature 


Time 


% 
DMSO 


62-1043 


62-87 


1021-1043 


62° 


60s 


5 
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271-407 


271-291 


388-407 


60° 


45 s 


0 


417-1043 


417-437 


1021-1043 


60° 


45 s 


10 


851-1824 


851-874 


1801-1824 


62° 


60s 


5 


1503-2400 


1503-1524 


2379-2400 


65° 


60s 


10 



For dye-primer sequencing these primers were modified to include T7 and SP6 primer 
sequences (ABI protocol P/N 4021 14, Applied Biosystems) at the 5* end of the forward and 
reverse oligonucleotides respectively. 
5 Chemical Mismatch Cleavage (CMC) 

CMC was carried out as decribed by Rowley et al (Genomics 30, 574-582, 1995) 
using internal labelling of probe and target with fluorescent dyes (RG6 or Rl 10). 6 % 
Acrylamide gels were run on an automated DNA sequencer (ABI 377, Applied Biosystems) 
on 12 cm plates (under module GS 12-2400 A) and analysed with suitable software (ABI 
10 GeneScan™2.1). 
Dye Primer Sequencing 

Dye-primer sequencing using T7 and SP6 primers was as described in the ABI 
protocol P/N 4021 14 for the ABI Prism™ dye primer cycle sequencing core kit with 
"AmpliTaq FS"™ DNA polymerase, modified in that the annealing temperature was 45° and 
15 DMSO was added to the cycle sequencing mix to a final concentration of 5 %. 

The extension reactions for each base were pooled, ethanol/sodium acetate 
precipitated, washed and resuspended in formamide loading buffer. 

4.25 % Acrylamide gels were run on an automated sequencer (ABI 377, Applied 
Biosystems). 

20 

2. Results 

All positions are based on the U50136 numbering. 
Variant Position 375 

CMC analysis of fragment 1 (62-1043) produced cleavage products of approximately 
25 300 bp and 670 bp in 9/49 subjects. Dye-primer sequence analysis of fragment 1 from 2 
subjects showing this pattern revealed a substitution of A for G at position 375. This was 
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confirmed by sequencing 6 clones of fragment 1 from one of these subjects; 5/6 had A at 
position 375 and 1/6 had G. 

Substituting A for G at position 375 modifies a Mnl I site at position 368. 
PCR products from LTC 4 synthase position 271 to 407 from 49 subjects were digested with 
5 Mnl I. This product contains an invariant Mnl I site at position 335 giving an invariant 61 bp 
fragment and a polymorphic fragment, 72 bp in the absence of site 368 or 33 and 39 base pairs 
if the Mnl I site at 368 is present. 9/49 subjects gave both the 72 bp and 33/39 bp products 
indicating that the Mnl I site at position 368 was lost from one allele and these subjects were 
heterozygous at position 375. The frequency of the A allele at 375 is thus 9/98. 
10 Additional RFLPs generated by this variant are loss of an M.CviA IV and M.Sss I site. 

This variant modifies a transcription factor binding site AP-2 CS4. 
Variant Position 815 

CMC analysis of fragment 1 produced cleavage products of approximately 230 bp and 
750 bp in 2/49 subjects. Dye-primer sequence analysis of fragment 1 from 1 subject showing 
15 this pattern revealed a substitution of A for C at position 815. This was confirmed by 

sequencing 8 clones of fragment 1 from this subject. 4/8 had A at position 815 and 4/8 had C. 
Substituting A for C at position 815 generates an Ava II site at 813. 
PCR products from LTC 4 synthase position 417 to 1043 from 53 subjects were 
digested with Ava II. In 5/53 subjects an Ava II site at position 815 was created. These 
20 subjects were heterozygous at position 815. The frequency of the A allele was thus 5/106 
alleles. 

Additional RFLPs generated by this variant are loss of Aca I, CviK I, M.CviA IV, 
CviJ I and Hae III sites and gain of Asp697 I, VpaKl 1 A I and Sin I sites. 

This variant modifies a transcription factor binding site: AP-2 CS5. 
25 Variant Position 1003 

CMC analysis of fragment 1 produced a cleavage product of approximately 940 bp in 
22/49 subjects. CMC analysis of fragment 2 (851-1824) produced a band of approximately 
800 bp. Dye-primer sequence analysis of fragment 2 from 24 subjects showing this pattern 
revealed a substitution of C for A at position 1003. This was confirmed by sequencing 14 
30 clones of fragment 1 from 2 subjects with the 940 bp cleavage product. 6/14 had C at 
position 1003 and 8/14 had A. 

Substituting C for A at position 1003 generates an Ava II site at position 999. 
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PCR products from LTC4 synthase position 417 to 1043 from 53 subjects were digested with 
Ava II. In 26/53 subjects an Ava II site at position 1003 was created. One of these subjects 
was homozygous C/C at position 815 and 25 were heterozygous C/A. The frequency of the C 
allele was thus 27/106 alleles. 
5 The 1003 C variant is not on the same chromosome as the 815 A variant. 

Additional RFLPs generated by this variant are gain of sites for Bcr I, AhaB I, Asp697 
I, VpaKl 1 A I, Asu I, Fmu I, Sau96 I, Sin I, Nla IV, Asp L Asp748 I, BsaC I, Dsa V, Ecol831 
I, Hin2 I, Hpa II, Msp I, Ben I, Nci I, ScrF I and M.Sss I. 

This variant modifies the glucocorticoid receptor binding site GGGACA, (Chan et aL y 
10 J. Biol. Chem. 266, 22634-22644, 1991). 

Sanak et al. (1998), Lancet, 350, 1599, have reported an increased risk of aspirin 
induced asthma (AIA) being associated with this polymorphism (Sanak's position -444 is 
equivalent to our position 1003). AIA affects about 10 % of adult asthmatics. Aspirin and 
other cyclo-oxygenase inhibitors cause release of LTs into airways, leading to an asthma 
1 5 attack. Clinical approaches to deal with AIA include pretreatment with anti-leukotriene drugs 
(Szczeklik (1997), Allergy, 52, 613-9). Commentators have written approvingly of the 
clinical utility of detection of LTC 4 polymorphisms (Holgate (1998), Lancet, 351, 1300-1301, 
see last paragraph in particular). 
Variant Position 2169 

20 Dye-primer sequencing of fragment 6 (1503-2400) from 47 subjects demonstrated a 

substitution of T for C at position 2169 in 3 subjects. 

Substituting T for C at position 2169 generates an Apa LI site at position 647. 

Fragment 6 was digested with ApaL I. In 3/54 subjects an Apa LI site was created. 
These subjects were heterozygous for the RFLP, thus the frequency of the T allele at position 
25 2169 is 3/108 alleles. 

Additional RFLPs generated by this variant are loss of M.Cvi A IV, Bca I, Hinp I, 
Hinpl I, Cfo I, Hha I and M.Sss I sites and gain of Aaq I, Bkal 125 I, BsaG I, CviR I, BsiHKA 
1, Bsp 1286 1, Hgi A I and Nsp II sites. 

This variant disrupts an MREc-(3) site (Labbe et al, Nuc. Acid Res. 19, 4225-423 1 

30 1991). 
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Example 2 

Detection of Variants 375, 815, 1003 and 2169 using ARMS™, 

The following primers were used in ARMS™ PGR to distinguish allelic variants at 
positions 375,815, 1003 and 2169 of the LTC 4 synthase gene. 



Allelic 
Variant 
Detected 


Allele Specific Primer 
Sequence** 


Constant Primer Sequence*** 


375 G 
A 


GGAGTTCTGGGTGTCTCCATC 
GGAGTTCTGGGTGTCTCCATT 


GGTC AGT CTGGACTTTGC C AC 


815 C 
C 
A 
A 


TAGGGGTTCCATGCACAAGGG 
TAGGGGTTCCATGCACAATGG 
TAGGGGTTCCATGCACAAGGT 
TAGGGGTTC C ATGC ACAATGT 


TTGTTACCTTGAGGCAAGAGG 


1003 A 
C 

A 

C 


CACCCACCTTATCTGTTCCCT 
CACCCACCTTATCTGTTCCCG 

GGAACAGC CTGGATGGGGTC A 
GGAACAGCCTGGATGGGGTCC 


AGGCTGGCAGGCATGAGGTTT 
TTCGTGCCCCTTCCTTGCCTA 


2169 C 
T 


CAGACCAGTGCTGGCTGTACG 
CAGACCAGTGCTGGCTGTACA 


CTCCAGCTGCTCCTGCACTGA 



** These primers are represented by SEQ ID NO: 10-21 respectively. 
*** These primers are represented by SEQ ID NO: 22-26 respectively. 

10 

Genomic DNA (50 ng) was amplified for 35 cycles with the above pairs of primers. 
The annealing temperatures were 62°, 60°, 64° and 60° for 375, 815, 1003 and 2169 
respectively. 

Homozygotes for the less common allele were only available for position 1003. The 
15 above primers and conditions would distinguish A/A, A/C and C/C genotypes at position 
1003. 

375 A/ A homozygotes were not available so it could not be demonstrated that the G 
specific primer would not recognise A/A homozygotes but the A specific primer did not 
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recognise G/G homozygotes. 815 A/A homozygotes were not available so it could not be 
demonstrated that the C specific primer would not recognise A/A homozygotes but the A 
specific primer did not recognise C/C homozygotes. 2169 T/T homozygotes were not 
available so it could not be demonstrated that the C specific primer would not recognise T/T 
5 homozygotes but the T specific primer did not recognise C/C homozygotes. 

Ex ample 3 

Polymorphism at Position 2742 

Dye primer sequencing, as described in Example 1, of fragment 2180-2972 from 5 



10 subjects demonstrated a substitution of T for C at position 2742 in 2 subjects. Template was 
prepared as described in Example 1 using the conditions set out below. 



! Fragment 


Forward 
Oligonucleotide 


Reverse 
Oligonucleotide 


Annealing 
Temperature 


Time 


% DMSO 


2180-2973 


2180-2200 


2953-2973 


65° 


30 s 


10 



This variant disrupts a RIPE3b site (Shieh andTsai, J. Biol. Chem. 266, 16708-16714, 

1991). 
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CLAIMS 

1 . A method for the diagnosis of a single nucleotide polymorphism in LTC4 synthase in a 
human, which method comprises determining the sequence of the nucleic acid of the human at 

5 one or more of positions 375, 815, 1003, 2169 and 2742 in the LTC 4 synthase gene as defined 
by the positions in SEQ ID NO: 1, and determining the status of the human by reference to 
polymorphism in the LTC 4 synthase gene. 

2. A method according to claim 1 in which the single nucleotide polymorphism at 
position 375 is presence of G and/or A. 

10 3, A method according to claim 1 in which the single nucleotide polymorphism at 
position 815 is presence of C and/or A. 

4. A method according to claim 1 in which the single nucleotide polymorphism at 
position 1003 is presence of A and/or C. 

5. A method according to claim 1 in which the single nucleotide polymorphism at 
15 position 2169 is presence of C and/or T. 

6. A method according to claim 1 in which the single nucleotide polymorphism at 
position 2742 is presence of C and/or T. 

7. A method according to any one of claims 1-6 in which the sequence is determined by a 
method selected from amplification refractory mutation system and restriction fragment length 

20 polymorphism. 

8. A nucleic acid comprising the 5' untranslated region of LTC 4 synthase comprising a 
polymorphism corresponding with one or more of positions 375, 815 and 1003 as defined by 
the positions in SEQ ID NO: 1 and in which there is an A at position 375, an A at position 815 
and a C at position 1003. 

25 9. A nucleic acid comprising the first intron of the LTC 4 synthase gene comprising a 
polymorphism at one or more of positions 2169 and 2742 as defined by the position in SEQ 
ID NO: 1 and in which there is a T at position 2169 and there is a T at position 2742. 
10. A diagnostic nucleic acid primer capable of detecting a LTC 4 synthase gene polymorphism 
at one or more of positions 375, 815, 1003, 2169 and 2742 in the LTC 4 synthase gene as 

30 defined by the positions in SEQ ID NO: L 
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11. An allele-specific oligonucleotide probe capable of detecting a LTC 4 synthase gene 
polymorphism at one or more of positions 375, 815, 1003, 2169 and 2742 in the LTC 4 
synthase gene as defined by the positions in SEQ ID NO: 1 . 

12. A diagnostic kit comprising a diagnostic primer as defined in claim 10 and/or an 
5 allele-specific oligonucleotide primer as defined in claim 1 1. 

13. A method of treating a human in need of treatment with an antileukotriene drug in 
which the method comprises: 

i) diagnosis of a single nucleotide polymorphism in LTC 4 synthase in the human, which 
diagnosis comprises determining the sequence of the nucleic acid at one or more of positions 

10 375, 815, 1003, 2169 and 2742 in the LTC 4 synthase gene as defined by the positions in SEQ 
ID NO: 1, and determining the status of the human by reference to polymorphism in the LTC 4 
synthase gene; and 

ii) administering an effective amount of an antileukotriene drug. 

14. Use of an antileukotriene drug in preparation of a medicament for treating a leukotriene 
15 mediated disease in a human diagnosed as having a single nucleotide polymorphism at one or 

more of positions 375, 815, 1003, 2169 and 2742 in LTC 4 synthase gene as defined by the 
positions in SEQ ID NO: 1. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

(A) NAME: Zeneca Limited 

(B) STREET: 15 Stanhope Gate 

(C) CITY: London 
10 (D) STATE: England 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE { ZIP) : W1Y 6LN 

(G) TELEPHONE: 0171 304 5000 
<H) TELEFAX: 0171 304 5151 

15 (I) TELEX: 0171 304 2042 



<ii) TITLE OF INVENTION: METHODS 



20 



(iii) NUMBER OF SEQUENCES: 2 6 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS-DOS 

25 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 



35 



(ii) MOLECULE TYPE: DNA (genomic) 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 



GAGCTCACAG AGCCCCCAGC TGGGGCATAT CTGGTTTCCG GGGGCAGGGG CGATACCCAG 60 

40 AGGAGGAAGA AGGGATTCTG AG AG AGCC C A ACAGGCTCCG AGCCTCAGGC TGGAGCTGAG 120 

CTTGGGGCAG CAAGGAAGGA CCAGGTGCGA GGGCAGAACC ATGCGGCCCG ACCCCTGCAG 180 

CACGGCCTGT GGCCTCCCCC AGCTCCTGCC CGTGCTTCTG GGTCAGTCTG GACTTTGC C A 24 0 

CTTCTGACCA AAAGCCACCG CAAACCCACT CAAGCCAAAA GAGGAAGTGA CCGTTAGGCC 3 00 

CAACTGGGAA GGCTGGCGGC CAGGGGCACT CCAGGCAGGG CGAGGGGGGC GGCCGGGGGC 3 60 

45 GCTCCAGGCG GGGCGAGGGA GACACCCAGA ACTCCAGGCA GGAGTCCTCG GGTGCCACCT 42 0 

TTCCTCTCCA CCTGGCCCTG CGTGGGCTCT GTCCTCAGGG TGGCCCGCCG TAGTCCCCCT 480 

CCCCACTCTG AGTTTCCTGT CCCAAAGTCC TAAGGAAGTT TCCAGAACTA CATCTCACCA 54 0 
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TCTTGAGTCA 


GCCTTGGCTC 


AGTGTCCATC 


TCACAGGCCT 


GGAAGGGGCA 


GGAGTCAGCA 


600 


CTGTCCAGAC 


CACAGGGCCT 


GAGTGTGGGG 


AGGGCAGCCG 


TCTAGGAAGG 


TGGTGGAGGG 


660 


TTGTTACCTT 


GAGGCAAGAG 


GGCTGCGGGG 


CAGAAAGACA 


CAGCAGGTGA 


CTGTTGTGGG 


720 


AGGCCCAAGA 


GAGGCCTGGG 


AGAGGGATGG 


CCCACAAGGG 


CTGACCCTCC 


CGCCACCCAG 


780 


GGGGCCTTGG 


ACAGGTTTCC 


TCCTGGCAGG 


GTGGCCCTTG 


TGCATGGAAC 


CCCTACAACG 


840 


ACTAAGGCTG 


GCAGGCATGA 


GGTTTCCTGA 


AGGAGAAAGA 


GCTTGTGGGG 


CCCAGTGTGG 


900 


CTGGGGGGGC 


GCTGGGACTC 


CATTCTGAAG 


C C AAAGGC AC 


TGGGAAGGGC 


TTCCGCAGAG 


960 


GAGGGTTTGG 


CAGGGGTTGC 


CAGGAACAGC 


CTGGATGGGG 


ACAGGGAACA 


GATAAGGTGG 


1020 


GTGGAGGAGT 


TAGC CGGGAG 


CCTGGGGCTG 


GCTCCAGCAT 


GATGTGGGGG 


TCTGCAAGGC 


1080 


C C TGG AG AAA 


GTGGGGTGGT 


GCAGCAGGGG 


GCACACCCAC 


AGCTGGAGCT 


GACCCAGATG 


1140 


GACAGCTTGG 


GCTCTGCCAC 


GCGGGACTAG 


GCAAGGAAGG 


GGCACGAACA 


AGCAGGAAGT 


1200 


GGTGAGGCGG 


TCTCCAGCTA 


GCTGCTCTCC 


CCTGCCCAGA 


CTTTGGTTTC 


CTCCCTGCTG 


1260 


GCTTGGCCTG 


GCTCCCTGGC 


TCTGTGTGGT 


ATGGTCACAC 


CCCCGTGCAC 


CCCCTCCACT 


1320 


GAGATGGGGC 


GGGG AGAGC A 


CCGAGGCTGC 


TCTTCCTCTC 


CTGGGCCGTC 


CTCTGAGCAG 


1380 


CAGACGGGGC 


TAAGCGTTCC 


CCAGCTCGCC 


TTCACACACA 


GCCCGTGCCA 


CCACACCGAC 


1440 


GGTACCATGA 


AGGACGAGGT 


AGCTCTACTG 


GCTGCTGTCA 


CCCTCCTGGG 


AGTCCTGCTG 


1500 


CAAGGTGGGC 


TGGTTCCTAT 


CTAGGAAGAG 


GGTGGGCC TT 


AGATCCCTAC 


AGCTTGCCCT 


1560 


CTGCCCCCTA 


GGCCCAGGTG 


GAGGGCAGAG 


GTGGGGACTC 


CAGCCCAGGC 


CCAAGCTGGA 


1620 


AGAGGGTGGG 


GACTTTCAGG 


GAACTGGGGG 


GCACCTGGCT 


GTGAGAGCTG 


TAGGACTTGG 


1680 


GGGTGGCAAG 


GGTGCCAGGA 


CAAATGGTAG 


GATAGCCATG 


GGC TTGGGGA 


AGCTGATCTC 


1740 


TGCTCTTTCC 


AGCTGTCCCC 


TCTCTGGGCG 


TCCCAGCAAG 


CGGCCCCCAT 


TCCCTGGCTC 


1800 


TGCTTCAAAG 


GCACCTCCAT 


ACTGGGACCA 


CGTGGAGCAG 


GGTAGAGGTG 


GGACTCCTTC 


1860 


CTCCAGCCCC 


CTAAAAAGAG 


CCTGCTTAAT 


GCCTTTCTCA 


GACTGGCCCT 


AAAGGACACA 


1920 


TTCCTTGGCC 


AGATATCCTT 


GCCACCTAAG 


AGACACCACT 


ACTCCACAGT 


GTGTGGGCTA 


1980 


GGATAAGGCA 


CAGCCTGGGG 


AGGGGGCTCT 


GAAGGGGCTG 


AACAGACAGG 


CCAGCCTGAC 


2040 


CTCCAGCTGC 


TCCTGCACTG 


AGCTGGATGG 


CCACCCTGTG 


ACACCCATCT 


GCAGAGGGCC 


2100 


CAGAACCAAA 


GGTGCCAGGG 


CTG C AGG ACT 


CAGGGGGAGA 


TGGTCCGACG 


GGAGGTCTGG 


2160 


GGAGGGAGCG 


CACAGCCAGC 


AC TGGTCTGT 


GTGTGGTCTG 


GCCTGGCCTC 


AC CTG AC C AA 


2220 


G AG AAGGGC T 


CCTGCCCACA 


GAGAAACTTT 


AGGGCCAGCC 


CACCCTCTGC 


AACTACCCCA 


2280 


GCCCTGGGGT 


CCTGGGGTTA 


GGCTAGGAGA 


GTCCCAGCTG 


CAACCTCCTG 


GGAGCAGGAG 


2340 


AGAAGGTGTC 


TGTCAGATTT 


AGGCCTGGGA 


CCGGAATGCA 


GGAACAGAGA 


AACTGAGGTT 


2400 


TGGAGGCACA 


GGGACGCAGG 


CTTTAGTGAT 


CCCGGCCTGA 


GGCAGGGTCA 


GAGGGCCCTG 


2460 


CTGGTGGGCG 


CTGGTAGGTG 


GGTG AC C AGG 


GACTGTTAGC 


TACAGGGAGT 


GTGCTTCCTT 


2520 


GCACCTGGGA 


GGATGCAGCC 


AGCTCTGCCC 


TCAGACTCCC 


GAGGCACTTC 


CTGGCCAGGG 


2580 


ACCTGAAAGC 


TGCATTTGCC 


TGTGTTTTGA 


GAGTGAAATG 


ATTCAGAAAC 


AAGGACTCAA 


2640 


GTGGTCTCTC 


TCGCGGAGCA 


GGTGTCCCTG 


TGCCTGAATC 


ACTCACCCTC 


CCCCATACAC 


2700 


TCACAGGTTG 


GGACAGGGCC 


TCTCTGCGCC 


CCAGGCTTCA 


GCCCTGCCCT 


CCTCGCTGAA 


2760 


TGTCAGGGAC 


ACAGGGCAGG 


CCAGGGATGG 


GTGAGACGAG 


AGGTCTCCTC 


GGGCGGGGAG 


2820 


GGGGCGGGGT 


TCCGCCTTAG 


GGAGGAGAGG 


ACACGGCCAA 


GTGAAGGGCC 


AGATTGCAGG 


2880 


ATCCCTCCCA 


CTCCCATCTC 


TGGGGCTTCG 


GGTGTCCAGA 


CCTGACTCCC 


GCTCCCCCTC 


2940 


CTCCCCCAGC 


CTACTTCTCC 


CTGCAGGTGA 


TCTCGGCGCG 


CAGGGCCTTC 


CGCGTGTCGC 


3000 


CGCCGCTCAC 


CACCGGCCCA 


CCCGAGTTCG 


AGCGCGTCTA 


CCGAGCCCAG 


TGAGGCGCGG 


3060 


CGGGAGGGCG 


CGGGGCGGGG 


AGCGAGCCCC 


AGGCGGGTCC 


GGGTCGCAGG 


ACCATCCCGG 


3120 


CCGGCGCGCT 


CATCCCACCC 


GCCCACCGCA 


GGGTGAACTG 


CAGCGAGTAC 


TTCCCGCTGT 


3180 


TCCTCGCCAC 


GCTCTGGGTC 


GCCGGCATCT 


TCTTTCATGA 


AGGTCGGGGT 


GTGGGGCAGG 


3240 


GGCGCACGCG 


CTGGACCCCC 


GGGACCCGCG 


CAGGGCGCTC 


ACCAGGCCCG 


TGCGTACCTC 


3300 


TCGCAGGGGC 


GGCGGCCCTG 


TGCGGCCTGG 


TCTACCTGTT 


CGCGCGCCTC 


CGCTACTTCC 


3360 
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AGGGCTACGC GCGCTCCGCG CAGCTCAGGT GAGGGCCGGG CGGGGAGCGG GGCGGGGCCG 34 2 0 

GGGAAAGATC GCGGGCGGGC GGGGCTCCTG GGGAGCGGGA CCGAAGCTGG GGGCGGGCGA 3480 

CGGGCCGGAG CCCAGCGCCT TTGGGGATTC GGTGGGCGAG CCCTGGCGGC GGCCAGAGGA 3540 

AGTCCCCGTG GGGCCAGGGT TGCGGCGGGG AAGAAGCGGG CCTCCTCGCG CCACCTCCCC 3600 

5 GCTGACCGCC GCCCGCAGGC TGGCACCGCT GTACGCGAGC GCGCGCGCCC TCTGGCTGCT 3 660 

GGTGGCGCTG GCTGCGCTCG GCCTGCTCGC CCACTTCCTC CCGGCCGCGC TGCGCGCCGC 3720 

GCTCCTCGGA CGGCTCCGGA CGCTGCTGCC GTGGGCCTGA GACCAAGGCC CCCGGGCCGA 3 780 

CGGAGCCGGG AAAGAAGAGC CGGAGCCTCC AGCTGCCCCG GGGAGGGGCG CTCGCTTCCG 3840 

CATCCTAGTC TCTATCATTA AAGTTCTAGT GACCGAGACC CGGGCTGCGT TCTCTGGGTC 3 900 

10 CGCGGGGGTG GCGCACCGCG GGCTACGGAG CCTGGAGGGG CCCAGCCCGA GTCCGGGCAG 3960 

CCCGGGGCGG GCTTCCTAGT GGCGGCGTGA GAGTGGCTGC GAAGGAACGA GCCCTCCCCC 4020 

TGGGGCGGGA CTGGATCCGG TCTTCACCTC CTACCCCACT CCCTACTCAG CCTCGGGGTC 4080 

ACAAGGCCGC CCAGTCCTGC CGGGGTTCAC CCTCCTAGCG CTCAGCGGTC TCCTCACCGG 4140 

TCCCCCTCCT CAGGGGCCTT CCCTCGACTC TCAGCCGCCG CAGTCCCTCG TCCCCTGGCC 4200 

15 TTCACAGCTG ACACTAGATA GAGCCTGTGG CTCTCTCCCC AGGTGAGGGC AGGGGTTTTT 4260 

CTTTTGGTCA GCACTGGATC CCCCTCGTTA ACTGTAGGTG TTCAGGGCAG CCCTCCGAGG 43 20 

TCCGCAGAGC TGCGGGCACC ATGGGAACGA AGTGAGTCAG TGACAGGCGG TCTCAAGGAA 4380 

ATGTCCAGAA GCCTTGGGGA TCCAGGGGAG GCCCACAGAA ACAAAGAAGT GACTTTTAGC 4440 

CAAGTATGCA GGAGAAACGG AGGAG 4465 

20 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

GGGGCGGCCG GGGGCGCTCC AGGCGGGGCA 3 0 

(2) INFORMATION FOR SEQ ID NO: 3: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

45 CTTGGACAGG TTTCCTCCTG GCAGGGTGGA 3 0 

(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGGTTGCCAG GAACAGCCTG GATGGGGACC 3 0 



(2) INFORMATION FOR SEQ ID NO: 5: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: other nucleic acid 
(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 5: 

ATGGTCCGAC GGGAGGTCTG GGGAGGGAGT 3 0 

25 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 3 0 base pairs 
30 <B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CTCCTGCCTG GAGTTCTGGG TGTCTCCCTT 30 

(2) INFORMATION FOR SEQ ID NO: 7: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



WO 99/10529 



-5- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TAGTCGTTGT AGGGGTTCCA TGCACAAGGT 
5 (2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: single 

<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

15 

TAACTCCTCC ACCCACCTTA TCTGTTCCCG 
(2) INFORMATION FOR SEQ ID NO: 9; 

20 (i) SEQUEL- CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: other nucleic acid 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GACCACACAC AGACCAGTGC TGGCTGTGCA 

30 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

GGAGTTCTGG GTGTCTCCAT C 

(2) INFORMATION FOR SEQ ID NO : 11: 

45 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAi.'DEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

GGAGTTCTGG GTGTCTCCAT T 

10 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

20 

TAGGGGTTCC ATGCACAAGG G 
(2) INFORMATION FOR SEQ ID NO: 13: 

25 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

TAGGGGTTCC ATGCACAATG G 

35 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



TAGGGGTTCC ATGCACAAGG T 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY : linear 

10 (ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 

TAGGGGTTCC ATGCACAATG T 

15 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

25 

CACCCACCTT ATCTGTTCCC T 
(2) INFORMATION FOR SEQ ID NO: 17: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 



CACCCACCTT ATCTGTTCCC G 

40 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 21 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii> MOLECULE TYPE: other nucleic acid 
<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 18 

5 GGAACAGCCT GGATGGGGTC A 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 

15 <ii) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGAACAGCCT GGATGGGGTC C 

20 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 20: 

30 

CAGACCAGTG CTGGCTGTAC G 
(2) INFORMATION FOR SEQ ID NO: 21: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



45 



(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21: 

CAGACCAGTG CTGGCTGTAC A 

(2) INFORMATION FOR SEQ ID NO : 22: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 22 



10 GGTCAGTCTG GACTTTGCCA C 



(2) INFORMATION FOR SEQ ID NO: 23: 



<i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 Ui) MOLECULE TYPE: other nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 23 



TTGTTACCTT GAGGCAAGAG G 



25 (2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

35 

AGGCTGGCAG GCATGAGGTT T 



(2) INFORMATION FOR SEQ ID NO : 25: 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 
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TTCGTGCCCC TTCCTTGCCT A 21 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: 



CTCCAGCTGC TCCTGCACTG A 



21 
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|2> VJ^ORMATlCM FOR ID MO.' .1 ! 

30 (i) &33£20KNC:bS CH ARAC'J'JSRLSTI CB ! 

(A) LENGTH: 4465 tutu* paixc 

(B) TYPE: rtuCle-ic acid 

(C) GTRANDEEHESS : sirtgie 

(D) TOPOLOGY : lUfcAr 

(ii ) MOLECULE TYPE: DMA < genomic) 

(x-,) SEQUENCE DESCRIPTION: SEQ ID HO: 1: 

UAUCTCMaiR fcGCCCCCADC TCISGGCATAT CTGGTTTOCG GGGGCAGGGG COATACCCaO 60 

40 ^OAGGAAr* AGW^TTCTG AGAGGCTCCG AGCCTCAGGC TGl^CTGAG 130 

IM'I^CHXXIW? OAAPGAAGGA DCAGGTGCGA GGGCAGAACC ATGCGGCCCG ACOCCTGlCA^ ISO 

CALMJUCJrTCT GGCHTCCCCC ftGCTCCTGCTC CGTOCTTCTG GGTCAG3TCTG GACTtTGCDA 24D 

tm.*t^tywr:C^ ^ftAWCACCX? EAAACDCftCT CAAGCCAAAA GAG&AAJGT3A CCGtfTAGOOC 30 D 

t^UWiTQGCSA* TOCTGGOGGC CAOW3BCACT CCAGGCAGOG CX3JWGGGGGC GGCCdGGGGC 36D 

45 iracrJMXiCX? GACACDCAOA AGtCGWSGCA GGA0TCCTOG GCjT^CAGCT 42D 

TTC^CPCTCCA CCTC-^CCCTG OGTGG9CTC7T GTOCTCAG9G TCGCCOGCCG TAttltCCCCT 4BD 

WrCCAC.TCTR ^.TTTTCCTGT CCCAAJ^TOC TAAGGAAGTT TCXftSAAC'T* DATCTCAOCA 54 D 
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TCTTQAEJTCA CsnrTTe-'rn^ AOTGTCCATC TCACAGGCX7T GGAAGGGGtA UOAErTCAUCA <SOG 

ctctlxzacac CA*>ADGGtx/r GA"TGTr2c;c:n; A-r*3GrAGCCG TCTAGGAAGG aWjfcOAAGG 6 SO 

•i-nflTTACCVT aAGtiCAACJAU CtiTTRCW^U ElAGAAAGACA CAGCAGGTGA CTGTI'G'MGG: 720 

/VGGCOCAAGA CiAGGCCTGGG AUAd£OAT<a& Ct>CAE?AArasn CTGACCCTCC OGCCACOCAG 7 BO 

5 Cy&GGCCTTGG A C AG GTTTCC TttM'GGC'AGG' Q'KiGLOCTni TEXTATSGAAT CrcTACAAOG 540 

ACTAAGGCTG GCAGGCATGA GGTTTCCTGA A^is^iAAGA CtTTVTl^TO CCCAGTGTGG 9 DO 

CTGGGGGGGC GCTOGGACOX: CATTCTGAAG CCAAAGGCaC TGCGAAGCinr: TTCOGCftGAS 9&0 

GAGGGTTTGG CAGGGGTTGC CAGGAACAGC CTGGATGGGG ACAOOOAACZA MTA^GTGG: 1020 

GTGGAGGAGT TAGC05GGAG CCTG0GQCT6 GCTCCAGCAT GATG^-CJlXJOO TrrraCAAGGE? 1 OfcO 

10 CCTGHiftGAAA aTOTSr^TGGT GCAGCAGSGG GGAGAiCCCAT AOCTGGAGCO' OtACEZCAEiATr; 3 14 0 

GAOAEXTTTCUS CtrPCTTrxif^C GCX^RACTAG GCAAGGAAGG GGCACGAACA iyt^AtfOAACT 1200 

nirrciAatscca Ttrreti&aErTA acTc?rrcTTcr; crTTCPCCftGA C1TTGGTTTC OTCOerscrg 12 £0 

E^rrtiGCC-rti GCK:C:CTtitf£ TOTC3TC:tc?C2T ATCJnTCAC&C COCCCTGCAC CCCCrcCACy 1320 

CiA&A'TGG&GC GOGGAGAGtA CCC^Al^CTQC TCrrrcCTETC CTC?E?5COGTC CTCTGAQCAG 13 B0 

15 tAGACGGGGC TAAGCOTTCC CCAjaWOGCO *i' U*OA CACACA <lClTXTf5?TrsCCA CCACACCGAC 1440 

GGJTACCATSjA AGGACGAGGT AGCTCTACTG G«XiCTO'J<CA CEX^rxTTCCX? AGTCCTOCTG 15 DO 

CAAGGTGGGC TGGTTOCTAT CTAGGAA6AG GGTGGGCC^T Al*ATLX:trrAH AGCTTGOCCT 15&0 

CTGCOOCCTA GGOCCAGGTG GAGGGCAGAG GTCGGX3ACTC CAGClXlftjC3t2C: CPAATTCT^GA 1620 

AGA^rXTTf^G G A GTTTCAfSG GAACTGGGGG GCACCTGGCT GI^AGADC 1 !^ TAOEiACTHSn: 1*9 £0 

20 aEiCTESCOAAO aCTClCI^QRA CAAATGGTAE GRTAGOCATG GGCTTGGGGA A<iO'lXiATCTC 1740 

TdCttcmwcc AdL-itau^eot^: tcthtcscx^u tcccagcrag cggcccccat tccctqdotc isdq 

TOCTU^CAAAJQ GCACC'JtCAT ACTUCJGAt^A crfTCCAC^Q GCTAGAjOGTG GGACTCCOTC 1660 

CTCV2AGCCHX' CTAAAAAOAG etTflfrrrAAT atX-TCTTCTCA GACTGGOCCT AAAGGACACA 1920 

TTCCTTGGJCC AGATATCCTT GOCACCTAAO AtiAOAUiAtrr A(rTCC*C*GT GTGTGGGCTA I960 

25 GGATAAGGCA CAGCCTGGGG AGGGGGCTCT OAAG9Q<3C^^ AACAEW.ra<?r5 CCAGCCTGAC 20-10 

CTCCAGCTGC 1CCTCCACTG AGCTO3ATGG COACCCTOOtJ ACACEXTATCT GCAGftGGGCC 2100 

CAOAACCAAA GGTCCCAGGG CTGCAGGACT CAGGGGClAC*A UWJTCCIlACn G^GOT^TCG 2 ISO 

GGAGGGAGOG CACAGCCAQC ACTGGTCPGT GTaTGOTCTG Gtt¥lMK&TC ArX!TG*CC*A 2220 

GAGAV3GGCT CCTGCCCACA OAGAAACTTT AGGGCCAGCt CAC^Cl'CJXSC: AAOTACCCCA 2 2 SO 

30 GCDCTGGGGT CC^GQSTTA GGCI'AGGAGA GTCCCAGCTG CAJhC^rtiCTQ (3E»GCAI*?AI? 3340 

AGAAGGTGTC TGTCAGATTT AGGCCTCGGA CCGPGAATGCA GOAACAOAOA AAC!T^AI^TT ^400 

TE^QAOTCACA GGGACGCAGG CTTTAGTGJ'.T CCCGGCCTGA GGCAD^E^JXl^ UACKJC^CCCrrn 34-60 

c■rW7^f?G^5CG CTHSGTftGGTG CGTGACCAGG GACTGO^ASC TACAGGGfAGT Q'JXJCTTCi^TT 2520 

C2EZACCTGI5GA OTATO^GDC AGCTCTCDCC TCAGACTCCC G A<3G CACJfT C* CT^GCCAQOD 2 5 B0 

35 ACE^mAAAGt: TGHATTTGnc TCZTC5TTTTGA G?v5TGAAA t JC ATTCAGAAAC AAGGACTCAA 2640 

GTGOTt>l*eiM- T^E>Lt3ESA0CA <idTt3TCCE^nj T^E^T^AAIC ACTCACCCTC COCCATAGAC 27 DO 

O^CACAG^T'Kl GGACA&GOeC 'J-C'l^JWSCl^CC CEZAanrTTra GCQCTGCOCT CCTCGCTGAA 2760 

TCTCAGGGAC ACAGGGCAfrG CO^OG-A'KiQ (^T^AElACCZAG ftGGTCTCCTTC GGGCGGGGAG 2-^2^ 

GGGGC-GGGGT -TCCXjCCTTAG C-OAGGAOACid ACACuiicCAA CT^AAGGSCC AGATTGCAGG ?,8Bi l 

40 ATCCCTCCCA CTC'CCAI^I'-C: KSGGGCTO^ OOl^TC^AOA CCTGACTCCC GCOCCXXCrTC 2-940 

CTCCCCCA^C CTACTTCTCC CTGCAG^I^A 'It^lxX3l3CE3[X: CATOGCCTTC CGCGTOTOGC 3000 

CGOCGCTCM: CAC'CGGCCCA CCCGAGTTCti Z&CGC&rCTb rX^QW?C5CCAG TGSAGGCGCX5G 3060 

CGGGAGGGCG CGGGGCGGGG AGCGAGCC'tX: aQl^L^tlx: <x:c^TTX?CAGG ACCATCCOGG 31SD 

CCGWC^CGCT CATC^C!ACCD GCTCAC-COeA Gd^l-OAAC'l^ CAU7GAGTAC TTCCOGCTGT 31 SP 

45 OCOTCGSCtlAC GCTCTXiGGTC GCC^CATCI 1 ■l^'i"^TCA«l , ^A AGCrTOGGGGT GTOGGGCAGG ^210 

GGOGCACGCX3 CTGGACCOTC ©GGACCCGCXS DaOGQC"GCTC A0C»JQW7OCG TGCX3TACCTC 3-^00 

7CGCAGGG(yC GGCGGCCCTG TCC9GOCTGG I'CYaCCTGY? OGC^iCCCETTC CGCT'AC^T^Z ^ESO 
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CAGCTCAGGtf 






tX?CGGGi>CCQ 










GGG©CTDC«I>3 


G3 GAGCGGGA 


CrcAAGCTOG 




J 4*0 








TTGGGGATTC 


GGTG3GOGAG 


COCTGGCGGD 




3540 








TCXWOGG^? AAGAAG03GG 


CCTCCTCGOG 




3600 


5 


GCTGfcCOGCC 








GPGDGCG5CCC 


TCTGGCTGCT 


36S0 






GCTGCGCMOG 








TGCGCGOCGC 


3720 




GCTCCTCGGA 


OGGCTCOGGA 








rXCGGGCCGA 


*37*U 








CGGAGCCTCC AGtL'TGOOCCtf 
















G AC og a gac c 


d2*3GCTBCCT 






10 




GCGCACOGCG 


GGCTACGGAG 


CCTGGAGGG6 






3£«0 






nrTTCCTAXST 


GC-CGGCGTGA 


GAjGTCJGCTGC 






4D20 








TCTTCACCTC 


CTACCCCACT 


CCCTACTC.V^ 


Cti¥Bti(IC3CTL s 


40AQ 








TOGGGTTCAC 


COTCCTA3CG 


CTXZAGCGGTC 




41X10 










m "AGC7CGCCG 


CAGTCCCTOG 


T-CCCOTGGCC 


4200 


15 


TTCACAGCTG 








AGGTCAGGGC 


AGGGGTTTfTT 


4»<S0 




CTO*FTGGTGA 


GCACTGOATC 








CCCTCCGAGG 


4J20 






TOCGGGCACC 








TCTCAAGGAfc 


4 5*0 






GCCTTGGGGR 


TCXZaGGGGAG 






GftCTITTAGC 


4140 






GQAGAAACGG 


AGGBiG 











20 



(2> I >f JKDKMAT I Ofq FOH SEQ IP NO; 2: 

fJi) 1JKSNC3TH: £0 fcaFB pairs 
25 ID) TYPEj r*uclcir? ani.rt 

LI» T0P0LOGV: linear 

(ii) MOLECULE TYPE: DLlier nucleic acid 
30 (xi> SEQUENCE DESCTKIPTIOWr SBQ IL HO: 2: 

aCK&toCJUCCR COGGCGCTCC AGGOGGGGCA 30 

<2> INVQKHATIfW FOR SBQ ID MO: 3; 

35 

1 3L > ^UEMHE nHABACTBRISTICS ; 

^ LEwaTH: 3-0 bdB* pairs 

EC) tl'J'tlhjgOBDNKaBj sing-Lo 
40 [I» TOPOLOav; linc/ir 

Cii? M OIL-ECU LB 'Wras other nur.lrcr.^ <? 
Cxi J SEQUENCE DEXSiDJrM PTKXaj tfB£ 7p *K>: 3 ■ 

4^ CTTGGfcCAGCl TTTCC'TCC^ a<lA<;rtt3T£C2A 30 

fS) INFORMATION FOR SfeQ ID NO : 4i 
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TYPlS- cxucleiu acid 

<d> topology: lii^ea* 
ill} MOLECULE 7WE; otter nucieic acid 

10 

GWrrocr^cs tiAJtfiaccOT:: ciatoi^ciatic SO 

1 5 Li) SBQUENCB CHARACTERISTICS : 

CA) LENGTH; SO base pairs 

(tj3 topology* liiu^T- 

20 

Mi) MULSraHLE TYPE: ot-hnoc nucleic JSC id 



atogtcmxiac GGCtfiaorcTG ggoagggjwst 

25 

(2) rWFOPJfATIOW FOR SEC? ID 1X>: 6: 

(i) GEQUEKCE CHAI^ACTBRIGTICS ; 
<A| LOENOTH: 3D baae uaire 
30 <B> TVTE> nucleic acid 

<nj TOrobor^y ; linear 



3D 



fiil M?LESCOT-JP! TYFK; nt^r CH^lel*? aci(3 
35 |x.i| f^KyirBMCE DE^TRIPTIQW? 3EQ ID ND; 6; 

CTL'CrTCiC.r.Ttl CWZTTCTKtttt TKlXyTCQZTT 30 

(2| iJIFtiiKttlvriGW iTClR ID Wftt 7: 

40 

f.fc) JUBMaiU: 3D fence paira 
(B) TYPKi nucleic acid 
tC) Sa'kAHC*EDHE£S i single- 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; Other jwcLcic acid 
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^xi'| ^iiOUE»^E DEBCRIPTTW; SHft TD WO! 7; 

TAGTCfWOO' AflCiGflT'llXlA TiXJACAAG&T 

5 (20 INFOKMATIQH FOR SEQ ID HO: 6: 

Til .^tnfWCKJ CWAPAGTEHIOTir^^ 
4 a} lenrtH; £Q biw* pairs 
{K) TYPE; nu^lFfcio no id 
10 &TRAtiU3EDWEa«: single 

JL>> TOPOLCdY: linear 

MOLECULE TYPE: L>bh«r nucleic acid 
{Xi> SEQUENCE fl&SCTUPTJQHs SEtf ID NO: B: 

15 

T^^lCTCCTCC AOCCACCTTA TCTGTTCTCS 

20 |i) SEQUENCE CtUKAGTi£K2£lTi:C9 : 

(A) LENGTH: 3D fcacc pairc 
£B) TYPE: nucletic at: id 
(CJ STRWSIDfEIKMESS : elicit 
<DJ TOPOLOGY: linear 

23 

(iii JMOt*CUT.F TYPE; othffif nu^Leic acid 
(xi.) RT*2U*?*:j? DESCRIPTION: 3D NO; * ; 

GilOLIAUACAC ACACCA^TQC TGCCTf^TQCA 

30 

\2'f IWFOfLM/k-J'lOW FOR SB£> ZD MCJ* 10 f 

(i) SEQUENCE ewu^tt&feld'riCg! 

35 C&l TYPE:- nucleic ocid 

(CI STR3^E3D«ESS: fcifl^le 
(D* TOPOLOGY: linear 

<1iJ MOLECULE TOPE; Otto** nucleic acltf 
40 <ir5 ) SEQUENCE DESCRIPTION; SEfc ID MO: ID 

^GACTTirrTO GTGTCTCCAT C 

(2) aNPC3KM>T70fq FOR SEC 1 " T Tj MO: 11; 

45 

(i| &SCH7JSGIIL7.E nHAHArTTBRL^TICR; 

\A) LKHCTH! 21 banc pair?? 



-<5~ 

IB J TYPE ; nucleic acir2 
(P) TOroiXX^; linear 
5 MnLEnjL,E typr; ether nucleic a-cid. 

GGAGTTCTGG GTGTCTCCAT T 

10 ^? JMFOHMA'riaN FOR SE£ IP NO: 12: 

CHARACTERISTICS; 

(B] TTKB; nunltf-ic: acid 

(Dj 'YOfOIuDUY : linrar 

ili] MOLECULE ¥Yfc*K3 other nun-loir: arid 
<Xil SEQUE1TCE DESCRIPTION! ID MO! 13: 

20 

TAGraGGTTCC ATGCACAAQG G 

(2) SNKSRXATZttN POK 5*2 IP WO; 13 r 

25 <U SEQUKiX.^ CKAKACJTERJRTir.Sf 

<M LENGTH: 21 bacc pairr. 
<DJ. TYPE: nucleic *eid 
{CJ- £TftW*DEDtflE££: circle 
{D} TOPOLOGY: linfetsf 

30 

111} MOLECULE TYPE : Other nucleic aci<3 
(Xlf SEQUENCE DESCRIPTION: SEO ID NO: IS: 

TAttl^TTCC ATGCACAATG G 

(2\ INFORMATION ID NO J 14; 

\A't LE2NGTH: 21 boj&n pairs 

40 [IO T¥VjK: nucleic acid 

\Cl) gTRJUHDEDMEaWi sing].* 

fD) TOr01/X?Y! linear 

(ii) MOLECyi.F: TVT'FJ; Ot.h-W" nucleic acid 
45 (xi) «B£OEMCR np^CRTPT^Of?- IP WO ; 14 i 

•FAGGGGTTCC AT GO h D T 
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(2) rWPGRHATrOK FDR SED I'D M): 15 l 



(i) SEQUENCE aiW^TERISTJCS? 
5 (A| LEMGTtS: 21 <baae pair* 

(PI TYPE: TiXiOls-ic acid 
(CI STKANPEiDtaEC?; single 



10 -jiil J*0liKOUJ_J4 TYPU! ofchnr nucleic acid 



TAG GG GTT OC *TX?CACAftTG T 



15 (21 INFOKKATJOQW FOCR 5EQ ID MO; 16: 



<Ej TfPK ? nuclei n raci^d 
20 £TRANDEDWIS£& « single 

(D> TOPOLOGY: linear 



UH MOLECULE TYPE: Ofchex- nucleic- ani.rt 
txi| SEQUENCE PTIGN : STEQ ID 1& 

25 



Ml JMTORMftTION FOR SEQ IP NO; 17: 

30 (i \ SEQUENCE CHAF^OT ERISTICS: 

1.BNGTPK; 21 baser pairs 
{J*} TYTE; nucleic acid 
^^n-PEPNESS; Simple 

35 

[Li J MOLBC'ULiE 1*fclfl! Dtlier oncl*? C ftcd<3 
(xij. EECJUEWDE DESCRIPTION: £KB(? I.D ND; 1? 

CftCCCAOCTT ATCTSTTCCC G 

40 

INFORMATION FOR SCO ID KO: 19: 



[A) LENGTH: 21 bnee pai,rs 
45 fm TYPE: nucleic ani.tf 

STRAMOEDNESa- single 
CD) TOPOLOGY: linear 
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(:'. i? MOLECULE TYPEi otfce*' nucleic acid. 

.SEQUENCE BB5CRI FTIOW ; SEfc ID *K>; 1* r 

42 > IKiFOPidATIDW **L>K KEt? IC DX3 ; 19 i 

[13 SEQUENCE CKfcJ^TltfU&TlCg; 
10 (A) LENGTH: 21 haec paijrc 

(B) TYPE: rtuclt=ic -sseia 

(Dj TOPOLOGY; linear 

15 (ii) WDUSCTULE tyft*; othar nuclei r. nni.rt 

{xi) SBQfUENCK INSCRIPTION ! EEV 1 ID »U! 

2i) 12) IWFOFMATICtt? FOR GEQ ID MO: 30: 

(i) RKpUTTOCE CHAKACTKRISTICS: 
(a| i.TCNGTKi 21 pairs 
i:ei tvpRs mxa*1ct acid 
25 sntwmETJKRSSr siagle 



30 



4 ill HDLE£UOS TYFiS; other nucleic? aoid 
\X±] GEQUENCE Ute&CJUPTIOW: SKD lb MD> 2D: 

HAfJACCAGTG CTGGCTGTAC G 21 

(2) IW^KMATICM FOR SEQ IB HO; 21: 

35 til SEQUENCE CHABACTEHISTICS: 

W LENGTH: 21 base pairs 
<eJ type; uncle* c acid 
W stf*m>ei3NE3S; a ingle 
TDPOIjCXT* ! D linear 



40 



C 11 J MQLfeaCUUS TYPE: Dtticr nunUiQ ;acid 
[Xl^ SEOlJiSW^H: EXESKHR I FTIOW ; IB JqO- 21: 



GftGACCaGTG CTGGCT^TAC A 

45 

W\ IOTGEMATIQM POK £i»y ID HO\ 22; 



21 
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I A.) LENGTH ; 21 jjairc 
IB J type j nucleic aci£ 
ICi 5TRAWDEDNE5S ; SirjQie 

(ii) HUL/KL v Li.LB TVPEl othnr nu^l^icr Rcid 
(xi) DfcfcCKrPTlUWj .RE>2 IJJ NO: £2 

LO &GTCv a >3TC'TG <3AOTTT3CCA C 

13? IJ^KKftTrCW FOR 5EQ It> NO; 23 : 

<£» TOCOLOGY? linear 

20 Ui| M&LECULE TYPE: DlJl6i- nucleic acid 

lxi> SEQUEtQCE DESCRIPTION: SEQ TD NO: 22 

TTGTTACCTT G^GCAftGAG G 

23 (21 INKDKMftTTOfKI FOE II> NQt 2i; 

(A) JjvBWCjthi 21 bace pairs 

(B) TYPE' nucleic aciiS 
30 (£) STE^iWDEOMEeS : ^-in^le 

(DJ TOPOLOGY: lifter 

MDSuECUIrE TYPE; other nuclfeic ooid 
(Xi) SEQUENCE DESCRIPTION: SEC ID WO: 54 

35 

AGGCTG3CAG QCATGAjGGTT T 

(2) ZNFOKKATIOM FOR SEQ LD MO; 25: 

40 ('I | SEQUENCE CHAF4ACTER I ST1CS : 

^> LENGTH ; 21 bas* pairs 
{B> TYPE; nucleic acid 

(C) ,U TRAi'XDB dw E 5 J3 s single 

45 



fix 1 HDLEroT.B TYRE: other jiunl*ic acid 
(xi) iJKpUSMCR ^SCRt^PJlOw : BEjQ ZFTi WO; 25 
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TTetrrucccx: TTCtrrrrxxrr a. 21 

(2) IW^OkJiATIOW PUR KEO ID 2<J i 

5 

<A| LEt'lOTO] : 21 ISAee pair* 
CB| rfPEi nucleic &cid 
(0\ STOANPEXMESS ; single 
10 <d| topdtj>tv: linear 

JtOLdfiCUl^K TYPE : otJicr Tmcil*io 3Ci<* 



15 CTCMOPGC TCCTGCACTG A 



21 



